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ABSTRACT. We study the fluctuations, in the large deviations regime, of the longest 
increasing subsequence of a random i.i.d. sample on the unit square. In particular, 
our results yield the precise upper and lower exponential tails for the length of the 
longest increasing subsequence of a random permutation. 



§. 1 Introduction 

Let {Zi}™ =1 = {(Xi,Yi)}f =1 denote a sequence of i.i.d. random variables with 
marginal law /i on the unit square Q = [0,1] 2 . Throughout, we make the assump- 
tion that fi possesses a strictly positive density p £ C 1 (Q) with respect to the 
Lebesgue measure A on Q. 

A subsequence {Z^, Zi e } C {Zi}^ =1 is called a monotone increasing subse- 
quence of length £, if 

< X ij+1 and Yi. < Y ij+1 , for j = 1, ...,£- 1. 

Define next i ma , x {n) to be the length of the longest increasing subsequence in the 
sample {Zi}™ =1 . Note that we do not require that ij < ij+i- 

In the case that /i, = A, f max (n) possesses the same law as the length of the longest 
increasing subsequence of a random permutation, denoted hereafter by L max (n). 
Building on the fact that 

lim ma *( - =2 in probability, 



c.f. [10], [11], we showed in [3] that 

(1.1) lim ma ^. - = 2J M in probability, 



where J M £ M + is the solution to the variational problem 



1.2) J M = sup / ^p(x,4>{x))(j>(x)dx 

4>eBi Jo 
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with 

= {(f> : [0, 1] — ► [0, 1], non-decreasing, absolutely continuous}. 

Furthermore, it follows from [3] that any longest increasing subsequence will con- 
centrate on the solutions to the variational problem (1.2). See [4], Proposition 4.4, 
for an alternative expression for J M . 

Note that J M = 1 for p(x, y) = 1, in which case the maximum is achieved on the 
diagonal x — ► <f>(x) = x. 

The fluctuations of i ma x.{n) and L m ax(ra) are highly nontrivial and have been 
investigated in several papers, c.f. [2], [9], [1], [5]. In particular, Aldous and 
Diaconis have exhibited quite different behaviors in their upper and lower tails. 
Our goal in this paper is to provide information on the large deviations of these 
fluctuations. The results and techniques differ sharply in the study of lower and 
upper tail, and we divide the discussion in the rest of this introduction between 
these two cases. 

Turning our attention to the lower tail we first show in Theorem 1 that for any 
-2 < c < 0, 

(1.3) lim -logP(L max (n) < (2 + c)v^) = -2H (c), 

n^oo n 

with an explicit function H , first introduced by Logan and Shepp in [6], 

1 (2 + c) 2 c + 2 , (c + 2) 2 Al , 2(c + 2) 2 > 

^ = -2 + ^^ + lQg ^'( 1 + — — ^° g Q + (c + 2) 2 )- 

See Fig. 1 for a plot of H (-). The proof based on the random Young tableau 
correspondence is purely combinatoric and sheds no light on the random mechanism 
responsible for the large deviations. In particular, we cannot prove that, conditioned 
on L max (n) < (2 + c)yjn, the longest increasing subsequence concentrates around 
a curve </>(•). 

While we could hope to use this result in order to prove an exponential lower 
tail for general \i, that is 

(1.4) lim - logP(C ax (n) < (2J M + c)y/n) = -2H^c), 

with H^(c) > for —2J fl < c < 0, we were not able to compute explicitly, nor 
to prove the existence of the limit in (1.4). We thus present in Propositions 2.1 and 
2.2 nontrivial upper and lower bounds on the left hand side of (1.4), avoiding the 
question of existence of the limit. 

The situation is quite different for the upper tail, here an easy sub-additive 
argument shows that for c > 

(1.5) lim log P(L max (n) > (2 + c)v^) = -U (c), 



for some nontrivial convex rate function Uo(-). In the first version of this work, we 
presented only bounds on Uq(-), leaving open the explicit evaluation of this function. 
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Subsequently, T. Seppalainen has proved in [8], using Hammersley's particle system 
associated with the Poissonized version of L max (n), that 

U (c) = (3{c) := 2(2 + c) cosh- 1 (c/2 + 1) - 4 ^c 2 + 4c . 

See Fig. 2 for a plot of Uo(-). In fact, Kim [5] had already observed, by combinatorial 
techniques, the upper bound in (1.5) with the function Uq{c). For the sake of 
completeness, we will present in Section 3 a combinatorial proof of the lower bound 
in (1.5). 

Our interest is in exploring the similar question for £ max (n), where the sub- 
additive argument is not applicable. Our main result in this direction (c.f. Theorem 
3) is in fact that 

(1.6) lim -L logP(£ max (n) > (2J M + c)y/n) = -U„{c) 

where 

Moreover we show that, under the conditioning that 

{WW > (2J M + c)Vn}, 

the longest increasing subsequences concentrate near the maximizing curves in (1.2). 

The precise statement and proof of these theorems is followed in section 4 by a 
discussion and several conjectures and open questions. 

Acknowledgment We thank JH Kim for pointing out to us that Hq{c) is an 
upper bound in (1.3), A. Dembo for useful discussions, and T. Seppalainen for 
sending us a copy of [8], where a probabilistic proof of (1.5) was first provided. 

§. 2 The lower tail 

In this section we first describe the large deviations for the uniform measure 
p(x,y) = 1. Let us recall some notation from [6]: For / 6 T , the class of nonneg- 
ative, nondecreasing functions on [0, oo) of unit integral, define 

r°° rf( x ) 

(2.1) H(f)= / log(f(x)-y + f- 1 (y)-x)dydx. 

■Jo Jo 

Let, for c € (-2,0), 

H (c) = mi{H(f), / G ^,/(0) = 2 + c} + 1 
c.f. [6]. Then H(f) > 0, and 

( , 2) ff0(cH 4 + (^ +log ^_ (1+ (^ )Iog( _^ ) . 

Note that Hq is a strictly convex, monotone decreasing function with minimum 
at c = 0, c.f. Fig. 1 below. Our first result, which is an immediate consequence of 
[6], is based on Schensted's identity: 
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Theorem 1. For any — 2 < c < 0, 

lim -logP(L max (n) < (2 + c)Vn~) = -2H (c). 



Proof. The basic idea is to use a combinatorial identity of Schensted, expressing 
the probability distribution of L max (n) in terms of Young tableaux, c.f. [7] and [6], 
§1: 

A Young shape r of size |r| = n is an array of n unit squares s, left and bottom 
justified, whose columns' lengths are nonincreasing from left to right. The hook 
length c(s) of a square s in the shape r is just the number of squares in r directly 
above and to the right of it, counting each square exactly once, c.f. Fig 3. Let 
7T ( T ) = llseT a ( s ) denotes the hook product, i.e. the product of all hook lengths in 
the tableau r. Then, the Schensted identity states that 

v — v n\ 

(2.3) P(L max (n) = k) = 2^ - — , k = l,...,n, 

r:r(0) = fc 

where the sum is taken over all shapes r containing n squares, possessing a first col- 
umn of length k. In order to estimate P{L max (n) < (2 + c)y/n) for fixed c 6 (—2, 0) 
it suffices to find an optimal shape t„ with t„(0) < (2 + c)y/n which maximizes the 
hook product 7r(r„). This is in essence the argument of [6] which yields the upper 
bound, c.f. (1.9), (1.10) and (3.2) there. We hence concentrate in the sequel in 
proving the lower bound 

liminf - log P(L max (n) < (2 + c)y/n) > -2H (c). 

n— >oo n 

Our goal is to find for fixed c 6 (—2, 0) a sequence of shapes {r„} of maximal hook 
product such that lim^oo \r n \/n= 1 and lim^oo T n (Q)/n l l 2 < (2 + c). Let f§ € T 
be such that 

tf(/ c ) = m£{H(f) : f€F with /(0) = 2 + c}. 
The curve f§ is constructed in [6], it has the support 

1 (2 + c ) L (2 + c) 2 
6o(c)= (2T^)- L X J + V 2 + L -2-^- 

Hence, the length of the curve {(x, /q(x)),0 < x < bo(c)} is bounded by some 
constant k c . 

We construct a particular Young tableau out of f§. For i = 1, [bo(c) y/n] = 
i max set = [/o( 7^ Note that j(z) is a decreasing sequence, and, because 
the length of {(x, f§(x)), < x < &o(c)} is bounded, 

m n = ^ ^ 1 > n — k c \fn. 

i=i j=i 
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The sequence i = l,...,i max } defines a Young tableau r n of size m n . More- 

over for any y < f§(x) such that i = [x^/n\ < i max and j = [y\/n\ < j(i), denoting 
by 7Tjj the hook length of the square with indices (i, j) , 

logTT^TvJ < log(/ c (x) -y+ (/ c ) _1 (y) -x)+ logn. 

Hence, for some constant C > independent of n, whose value may change from 
line to line, 



nH(ti)=n r° (C) [ f0iX) log {f£(x) - y + (fc)-\y) - x) dxdy 
Jo Jo 




It follows that for any — 2 < c < 0, 

P(W™n) < (2 + C )v^) > -^2 > ^fn! e - 2 " 1 °^ e -C^lo g n e -2nH(/=) 

(7r(r„)) 2 n! 

> e -Cv^logn e -2n[H(/ c ) + i]^ 

Finally, for any c < 2, by rescaling, 

P(L max (n) < cy/n) < P(L max (m n ) < cy r m^) = P(L max (m n ) < cy/n — -=^) , 

\/n 

and the conclusion follows from the continuity of H(f§) in c. □ 

An immediate corollary, which will be useful below, is the following: 
Corollary 1. For any — 2 < c < there exists a function r)(c,5) satisfying 

lim 7i(c, 5) = 

such that ifp(x,y) satisfies (1 — d) < p(x,y) < (1 + S) then 

limsup|-logP(£ max (n) < {2 + c)y/n) + 2H (c)\ < r)(c,6) 

n — >oo n 

Proof. The proof is based on the same idea as the proof of Lemma 7 in [3] . By a 
possible change of coordinates in the x axis, we may and will assume that p(x) = 
Jo p( x >y)dy = 1, and that \p{y\x) — 1| < 5' = 25/(1 — 5). Let Pj be the law on [0, 1] 
with density p(y\Xi). Note that Pj may be written as a mixture of a uniform law 
(with weight (1 — 5')) and another law on [0, 1], depending on Xi and denoted qi, 
that is Pi(dy) = (l-5')X 1 (dy) + 5'q i (dy). Thus, the sample ((X 1 ,Y 1 ),.. . , (X n ,Y n )) 
possesses the same law as Z n = ((Xi, {m\U\ + (1 — mi)Wi)), . . . , (X n , (m n U n + 
(1 — m n )W n ))), where {Ui}2 =1 is a sequence of i.i.d. uniform random variables, 
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independent of the sequence {Xi}f =1 , {mj" =1 is a sequence of i.i.d. Bernoulli (1— 5') 
random variables, independent of the sequences {Ui}f =1 and {A^}™ =1 , and {Wi}™ =1 
is a sequence of random variables whose law depends on the sequence {Xj}™ =1 . Let 
/ denote the set of indices with m; = 1, and let N n = J27=i ^rn i= i = \I\ denote the 
number of indices where a uniform random variable is chosen in the mixture. Note 
that one may find a 5" = 5" (5) > 5', 5" (5) ^ s ^ such that 
(2-5) 

limsup -\ogP(N n /n<l-S") = -\(l-S") log + 5" log < -3H (c) 

n^oo n V 1 — 0' 0' ) 

for all 8 small enough. Let £ max (n) denote the length of the maximal increasing 
subsequence corresponding to Z n , then £ ms , x (n) possesses the same law as ^ ma x(^) 
and, on the other hand, is not smaller than the length of the maximal increasing 
subsequence when one considers only those indices i G I. The latter is distributed 
precisely as the length of the maximal increasing subsequence of a uniform sample 
of random length N n which is independent of the uniform sequence. Therefore, 
(2.6) 

P(Wn) < (2 + c)y/n) < P(L max (n(l - 6")) < (2 + c)y/n) + P{N n /n < 1 - 5") . 
The continuity of H (c) implies that for 5 small enough, 

2H ( 2 + C - 2) < 3H (c) . 
VI - 5" 

Hence, (2.5), (2.6) and Theorem 1 imply that for 5 small enough, 

limsup P(Cax(n) < (2 + c)y/n) < -2H ( lll^ - 2) = -2H (c) + g(c, 5) , 

n^oo Vl — 0" 

where the continuity of Hq(-) implies the required properties of g(c, 5). The com- 
plementary lower bound is proved by a similar coupling. □ 

We now turn to general case and prove first a lower bound estimate: for fixed 
d > set 

(2.7) I^ = M{H(u\fi):ueMi(Q),2J u = d}, 

where .Mi(Q) is the set of probability measures on Q and H(v\ii) denotes the 
relative entropy of v with respect to /i: 



H{y\n)= [ \ og ^y\u(dx,dy) 

Jq p{x,y) 



if ^ = q and H(y\n) = oo otherwise. 

Although an explicit computation for 1^ seems impossible, it is quite easy to 
verify that I^(d) = for d > 2J M , and < I^(d) < oo if < d < 2J fl , (e.g., by 
combining Lemma 1 and Proposition 2.2 below). Note that (1.1) implies that under 
Q = Y[ v, for each e > 

lim Q(£ max {n) < [2J V + e)y/n) = 1. 



Using a standard change of measure argument, we get from this: 
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Proposition 2.1. For fixed < d < 23^, 

liminf - logP(£ max (n) < d^/n) > -iJd). 

n—>oo n 

However, a simple comparison with 2H in case [i = A shows that I\ (2 — •) is 
not the correct rate function: 

Lemma 1. Take = X, then 

(2.8) hminf^^>l 

<5\,o 5 3 9 

Proof. Assume the existence of v§ such that 2 J vg < 2 — 5 but lim^o H(v&\ \)/5 3 < 
4/9. For a fixed K > (independent of 5), let qs(x,y) = diss / d\(x , y) , and denote 

A* = {(x,y) G Q :g 5 (x,y) < (1 + <$#)}■ 

One easily checks that \{A K ) < 5/K 2 . Thus, we may assume that qs{x,y) = 
1 + 5m(x,y) for some m which, on Ak, is bounded above by K. Consider the set 
of curves x : (0, 1 — y) — ► 4>y( x ) = y + x where < y < 5/3. Then 

1 — - > Ju(4> y ) = / \A + 8m(x,y + x) dx 
* Jo 

5 f 1 ~ y 

>(l-y) + - (K Am(x,y + x))dx + 0(5 2 ) 
2 J o 



and therefore 



<J 1 

3 ~ 6 



5 2 > / JA<Py)dy 
Jo 

5 1 5 I" 5 / 3 f 1 ~ v 

-3~i8 2 Jo Jo (K Am ( x >v +x )) dxd v + °( 6 )- 



Thus 



/■<5/3 

(5/ / {K Am(x,y + x))dxdy < -25 2 /9 + 0{5 3 ) , 
JO ./o 

and, by symmetry, for As = {(x,y) € Q : —5/3 < x — y < 5/3} H Ak, 

u(A s ) < X(A S ) - 45 2 /9 + 0(5 3 ) , 

with limx^oo lim^^o A(Aa)/5 = 2/3. Now the infimum of H{y\\) under the above 
condition is achieved at the constant density 1 — 5' , where 

A(A 5 j 

on A5, and ^x^oo is a constant independent of 5 whose value may change 
from line to line. Substituting in H(u\\), one obtains 

H{v s \\) > A5 3 /9 + g K 5 3 + 0(5 4 ) . 
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Taking the limits as 8 — > (first) and then K — > oo yields a contradiction. □ 

Note that our argument is quite rough and with additional work one could pos- 
sibly identify the constant b 6 [|, §] such that lim^o ^^T^ = b, but this is quite 
irrelevant since a simple computation shows 2H (0) = 2H' Q {Q) = 2H (0) = and 
2H (0) = \ and therefore 

2goW =± 4 J A (2 
<5to (5 3 1 2 9 ~ 5\o (5 3 

Our next result shows a volume upper bound: 
Proposition 2.2. Lei c < 0, then 

limsup - logP(^ max (n) < (2J M + c)Vn) < 0. 

71— S-OO 

Proof. Let denote an optimizer in (1.2) (whose existence is ensured by [3]). Fix 
A > with A -1 an integer, and for i = 1, . . . , A -1 let 

Qi = [(i-l)A,iA]x[<l>((i-l)A),<f>(iA)], Pi = A(0(iA)-0((i-l)A))p(iA,0(iA)). 

Set n, = npj, and let m, denote the (random) number of points in the sample 
{Zi}f =1 which belong to Q^. Then, for any e > 0, by Sanov's theorem, for A > 
small enough, 

(2.10) limsup - log P(U^ 1 1 ^ i (e)) < where ^(e) = {|n; - m;| > en;}, . 

n — >oo n 

Let tiirrii) denote the length of the longest increasing subsequence among the wi; 
points in Qj. Let P mi denote the law of the sample in Q i; conditioned on m;. Then 
P m% possesses a product law with density Pi, satisfying lim^^o sn P x ,yeQi \Pi( x i V) ~ 
1 1 = 0, c.f. Lemma 2 of [3]. Corollary 1 implies that for any e' > and all i, and 
all A small enough, 

limsup — log P mt (£i (mj) < 2(1 - e')^/rh~) < 0. 
Hence, for A small, on rij{|nj — m;| < en;}, 

(2.11) limsup - log P(^(mi) < 2(1- e') V™7 IMzii 1 ) <0. 

n— s-oo Tl 

Note that (c.f. [3]) Yli \J~P~i "^A^o «^u- Choose A small enough such that | Yli y/P~i~ 
J„| < |c|/2. Then, 

P(C ax (n) < (2J M + c)V^) < P(ut^^(e)) 

+ J B[uf =1 1 P(^(m l ) < 2 v ^- + cA v ^/2|{m,});nf =1 1 A l ( e ) C ] 
< A -1 maxP(|n; — m;| > en;) 

i 

+ A _1 max max P mi (^(m;) < 2 v / m7(l + e) + c' y/mi) , 

i m,:|mi- m <erii 
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where c' < is independent of e and A. Choosing now e small enough such that 
2e + c' < and using (2.10) and (2.11), the proposition follows. □ 

Remark: It is instructive to relate f max (re) to J„ for an measure v associated 
with R n = — Y^i=i &Zi-> ^ ne empirical measure of the sample. To this end, define 
for e > 0, the random measure R Uye with constant density e -2 on the squares 
Q e (Z t ) = [-|,f] 2 + Z h i = l,...,n, that is 

d R 1 v n 

i=i 

Note that, P almost surely e n = \ mini< i<: ,< n (|Xi — Xj\ A |Yj — Yj\)>0.A simple 
computation shows 

(r, -i \ ^max( n ) j 

(2-12) —j=- = J R ^ n , 

and therefore {l ma x( K ) < = { Jn n en < d}, for each d > 0. However a deriva- 

tion of the large deviation principle using this equality fails, due to the discontinuity 
of the mapping v — ► J v . In particular, R n ,e n converges weakly to /j, on the other 
hand, we have lim^^ ^ m ^ w) = 2J M . 

§. 3 The upper tail 

Here the situation is quite different from the lower tail, and in some sense much 
simpler. Our first result is: 

Theorem 2. For allc>0 

(3.1) lim -L log P(L max (n) > (2 + c)v^) = -U (c), 

n^oo fx I 

where Uq : [0, oo) — > [0, oo) is a continuous, strictly increasing convex function 
with Uq(c) = iff c = 0, and 

(3.2) U (c) = (3(c) := 2(2 + c) cosh- 1 (c/2 + 1) - 2^c 2 + 4c. 

Note that Uq(c) = 0(c 3 / 2 ) as c — ► 0, this is also predicted by the behavior 2Hq(c) = 
0(c 3 ) as c — > . The explicit computation of C/o(c) was first done in [8] using 
Hammersley's particle system. 

Proof. As pointed out by [1], the convergence in (3.1), and the convexity and mono- 
tonicity of Uq follows from sub-additivity. We briefly recall the argument. Let M n 
denote the number of points in a Poisson point process of rate A„ = n\ on the 
unit square, and let L m ax(A/' n ) denote the longest increasing subsequence in that 
sample. Then, for any e > 0, a direct computation using the Poisson distribution 
yields 

(3.3) lim sup -^= log P(\M n /n - 1| > e) = -oo . 

n— s-oo v n 
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On the other hand, conditioned on J\f n , the law of the sample is uniform and 
hence 

P(L max (N n ) = x\M n =m) = P(L max (m) = x) . 

Therefore, 

P(Lm^(M n ) > (2 + c)y/n)/P{M n > n(l - e)) > P(L max (n(l - e)) > (2 + c)y/n) 
while 

P(L max (M n ) > (2 + c)Vn) - P{M n > n(l + e)) < P(L max (n(l + e)) > (2 + c)y/n) , 

which implies (using (3.3)) that (3.1) holds as soon as it holds with L max (Af n ) 
replacing L max (n). 

On the other hand, consider the squares Q 1 = [0,y/n/(y/n + ^Jrn)] 2 and Q 2 = 
Wn/{y/n+y/m), I] 2 Q Q, and denote by ^LxC^vn+v^) 2 ) and £Lx(^(Vn+VJS) a ) 
the length of the longest increasing subsequence in the squares Q 1 and Q 2 , cor- 
responding to A/( /^)2 . The scaling and independence properties of the Pois- 
son process imply that imax(-^(vn+v™) 2 )' * = are hidependent, and that 
the laws of ^wC^y^+v^) 2 ) and L max (Af n ), respectively iL^^+^O and 
{Mm)-, are identical. Therefore, since 

^max(A/(^; +v /i)a) > L max (W( y^) 2 ) + £ max (A/(^ +v /^)2 ) ' 

we deduce that 

P(imax(AA (v ^ +v ^ )2 ) > (2 + C )(v^+ v^)) 

> p (iL(^(^+ v M^) + 4x(^( V 5 + ^) > (2 + c)(y^+ v^)) 

> f(iiax(^(^ + ^=) > (2 + c)v^(£max(A/'(^ +v ^)2) > (2 + C ) y^) , 

= P(L max {Af n ) > (2 + c) \/n)P(L max (J\f m ) > (2 + c)v^) , 
which immediately implies the existence and convexity of the limit 

U (c) = lim \ logP(Z max (A/' fc 2) > (2 + c)k) . 

Next, since 

P(L max (Af {[VE]+1) 2 > (2 + c)y/n) > P(L max (Af n ) > (2 + c)y/n) 

> P{L max {M [sM 2 > (2 + C )y/n) , 

(3.1) follows with ?7o = E/o- It thus remains only to explicitely compute Uq(c). 

In fact, Kim has already observed that Uq{c) > /3(c), c > 0, see equation (1.6) 
in [5]. We thus concentrate in the sequel in the reverse inequality. The proof is 
constructive: we exhibit an appropriate collection of Young shapes. 

Fix n large enough, e > small and c > 0, and let T c ^ n denote the set consisting 
of Young shapes of size n c = n — \(c + e)^Jn\ . Recall the function f^eF defined 
in Section 2, and define 

T c % = {t£ T c , n : |r(0) - 2y^| < ev^,sup \n-^ 2 T(xy/rQ - f§(x)\ < e} . 
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It follows easily from [5], [6] and (2.3) that for n large enough, 

(34) y nJ >- 

For each r G T^ n , define a new Young shape r' obtained by increasing the height 
r(0) by \{c + e)y/n\. Note that r'(0) > (2 + c)Vn while \r'\ = n. From (2.3), we 
have 



P(L msx (n) > (2 + c)y^) > £ 



n! 



(vr(r')) 



2 



n! Y" n c ! / tt(t) \ 
«c! c ^ (^(r)) 2 W(r')i 



2 



For r € T c <; n , 



^) = / r(c+ ^ a T ^ (T(o) + r( C + e )v^-j+r(j)) 

V fi (r(0)-j + r(j)) 

Note that, due to the definition of T^ n , 

, TT (r(0)+r(c + e )V^-J+r(j)) 
S }i (r(0)-i + r(j)) 

<V^[J o log(2 + c-x + / °(x))dx-^ log(2-x + / °(x))dx + C £ ] + o(Vn) , 

where C e -^ e ^o does not depend on n. Using the change of variables x — fo(%) = 
£,x = /i (0 and 5o (£) = MO - 1 + «0ti(O)/ 2 as in Pg. 212 of [6], one obtains 
after some manipulations that 

! log(2 + c-x + f G {x))dx- [ log(2-x + / °(x))dx 

^(0(log(2 + c - - log(2 - OR + / (log(2 + c - - log(2 - OR 

-oo ■/ 

(3.5) 

= ng (2 + c) - 7T5o (2) + (2 + c) log(2 + c) - c log c - 2 log 2 , 

where go denotes the Hilbert transform of go and is given by (2.31) in [6]. Note 
however that by (2.22) in [6], 7r<? (2) = 2 - 2 log 2, while (2.22) and (2.32) in [6] 
imply 7T5o(c + 2) = (2 + c) - (2 + c) log (2 + c) + 13(c)/ 2 . Substituting in (3.5), and 
then using (3.4), one concludes that 

liminf -L logP(L max (n) > (2 + c)y/n) > -(/3(c) + 2C e ) . 



Taking e — ► yields the desired conclusion that Uo(c) < (3(c) for c > 0. □ 

The following corollary follows from Theorem 2 in the same way that Corollary 
1 followed from Theorem 1: 
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Corollary 2. For any c > there exists a function rj(c, 5) satisfying 

lim fj(c, 5) = 

such that if p(x,y) satisfies (1 — S) < p(x,y) < (1 + S) then 

limsup|-^logP(4iax(n) > (2 + c)^) + U (c)\ <fj(c,S). 

n— s-oo y n 

Let if C be the set of solution to the variational problem (1.2). 
Theorem 3. For allc>0 

(3.6) lim logP(^ max (n) > (2J M + c)VH) = -J„U (c/J^ . 

Next assume that K = {<^>i, </> r }. T/ien /or eac/i (5 > and longest increasing 
subsequence Z max = {(X^.,^.), j = 1, ...,£ max (n)} , 

lim P(min max |Fj - cj> a (Xi )\ < S\£ max (n) > (2J„ + c)^/n) = 1. 

n^oo q = 1 j' = l 

Proof. We begin by providing a lower bound in (3.6). Let 4> denote a maximizer 
in (1.2), and define A, pi,ni,rrii,Qi, li(rrii) be as in the beginning of the proof of 
Proposition 2.2. Fix 5 > 0, and reduce A if necessary. By Sanov's theorem, 

limsup ■ logP(|ni — mj| > 5ni) = — oo . 

n— s-oo Tl ' 

Hence, 

liminf -L- logP(^ max (n) > (2J M + c)y/n) > 

1 A " 

liminf -— log P(V^(mi) > (2J M + c)aA; {|n; - m;| < c^}) . 

n^oo 77, / — 
i=l 

Next, for each i and ti > 0, using lmiA->o sw P x ,yeQi \Pi( x jU) — 1| = 0> one has by 
Corollary 2 that for some 5' (A) -^a^o 0, 

(3.7) | lim —J— logP ro .(4(mi) > (2 + U)^) + U (U)\ <fj(U,6'). 

m.^oo 777, ' 

(P mz is the law of the sample in conditioned on m,). 

Recall pi = ni/n = A(4>(iA) — 4>((i — l)A))p(iA,(/)(iA)), and fix a sequence 

{U > 0}t=i such tnat 

A" 1 

(3.8) 2(2 + ti)V(l-^i > (2J M + c). 

i=l 
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Then, 



^lo g P(^^(m l ) > (2J At + c) v ^;n i d 1 {|n i -m i | < 6m}) 
i=i 

1 A " 

l°g . inf P(^^K)>(2J M + C )v / ^|{m J }) 

{mi:|ni-mi|<ini) — * 



> 

n 



i=i 

A- 1 



^ log n P ».( 1 -«)(^ ra ') ^ (2 + t i )V(l-«y)n i ) 



n 

i=i 



where the last inequality is a consequence of (3.8), the monotonicity of li(mi) in 
rrii, and of the (conditional in {raj}) independence of the £i(rrii). Thus, combining 
the above with (3.7), one concludes that 



I A_1 
liminf — logF(Cax(n) > (2J M + c)V^) > - V y/ Pi (l ~ 6)(U (U) + fj(U, 8')} . 



i=l 



Since the last bound is valid for any choice of {ti} satisfying (3.8), we conclude that 
liminf — --- logP(^ 

max 

(3.9) 

A" 1 A- 1 

-mi{Y,Jpi{Uo{ti) + n{ti,5')) :t> 0,^(2 + ti ) y/^L^S)^ > (2J M + c)}. 

i=l i=l 
A -1 — _ 

Recall (c.f. [3]) that J2i=i \[pi "^A^o J^- The smoothness of 4> proved in [3] 
and (3.9) imply therefore, by taking the limit A — > in the right hand side of (3.9), 
that 

liminf -—pr logP(£ max (n) > (2J fl + c)y/n) 

> - inf { f Jj>(x)p(x,<l>(x))U (t(x))dx : 
J o 

(3.10) t € C([0,1],R+), f t(x)Jj>(x)p(x,<l>(x))dx > c} . 

Jo 



Making the change of variables dy = y <f>(x)p(x, <f>(x))dx, t(x) — > t(y), the right 
hand side of (3.10) becomes 

(3.11) -inf{ / "u (t(y))dy : i € C([0, J M ], R+), / " t{y)dy > c} . 

Jo Jo 

Take now t(y) = c/J^ to conclude from (3.10) that 

liminf logP(£ max (n) > (2J M + c)y/n) > -J U (-^-) . 
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The proof of the complimentary upper bound is only slightly more complicated, 
and involves the same tools as in [3]. Let A y << A, with A" 1 an integer. Define 

a "block curve" as an integer valued sequence , satisfying j(i + 1) > 

and j(A~ 1 )A y < 1. Let B A denote the set of all possible block curves, and note 
that the cardinality of B A is finite. To any block curve b 6 £> A associate naturally a 
(piecewise linear) curved, and define Qi = [{i— 1)A, iA] x 1)A 3/ , + l)A y ], 
fhi as the number of points within Q^, p, = A(j(i) — j(i — 1) + l)A y p(iA,j(i)A y ), 
fii = npi and £i(b, n) as the length of the longest increasing subsequence within Q^. 

Clearly, i max (n) < max beB A Yli=i ti(b,n). Hence, 

4to logP(W(n) > (2J M + c)Vn) < 

A- 1 



n 



1/2 



i=l 



Fix S > small. Repeating the argument used in the proof of the lower bound, one 
finds (reducing A, A y / A if necessary, but independently of n) that 



1 A 

limsup^ logP(5^(6,n) > (2J M + C )v^) 



nV2 

i=i 



A- 1 A- 1 

< - mf{J2 VFiiUoiU) + v{5',U)) :U>0,J2(2 + U))y/pi(l + 6) > 2J M + c} . 

i=i i=i 

Let Jcj, = \J <j)(x)p(x, 4>(x))dx. With A small enough, 

A- 1 

i=i 

Hence, taking now first n — > oo and then A — > 0, followed by 5 — > 0, one concludes 
that 

limsup — ^ logP(^ max (n) > (2J M + c)>/n) 
< — hnMnf{ J \J (p(x)p(x, 4>{x))Uo{t{x))dx : 

(3.12) t(-) > 0, f t(x)Jj>(x)p(x,<i>(x))dx > 2(J M - J^) + c) . 

J o 

Making the same change of variables as in the proof of the lower bound, the right 
hand side of (3.12) equals 

/ = inf inf{ \' h U (i(y))dy : ['* t{y)dy > c + 2(J M - J^)} 
</>e-B T Jo Jo 

r A- IT 

(3.13) > inf {J^o(-2+-V^)}, 
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where the last inequality follows from the convexity of Uo and Jensen's inequality 
Let x = — 2 + c+ j J ^ , Then, since < J M , x > c/J^. Hence, using again the 
convexity of Uq and the fact that Uq(0) = 0, cUq{x) > J^xUq(c/ J^) and hence 

(3.14) J+Uoix) > ^^C/ (c/J M ) > J^Uo(c/J^) , 

with the second inequality being strict unless = J M . (3.12), (3.13) and (3.14) 
imply the required upper bound. 

Finally, the last statement of Theorem 3 follows from the fact that the inequality 
in (3.14) is strict unless = J M , and the fact that the assumption of finite K 
implies A4 of [3] (the proof is similar to the proof of Lemma 4 in [3] and is thus 
omitted). □ 

§4 Open problems and remarks 
We conclude this paper with a list of comments and open problems. 

1) We have left open the question of existence of limit in (1.4) and of the com- 
putation of H^{c). After a discretization as used in Section 2, maybe techniques 
borrowed from percolation may allow one to control the interaction between over- 
lapping "block curves" . 

2) We have seen in Theorem 3 that, under the conditioning {£ max (n) > (2J M + 
c)y/n}, (c > 0), any longest increasing subsequence concentrates along the solution 
to the variational problem (1.2). The corresponding question for {£ ma , x (n) < (2J M + 
c)y/n}, (— 2J fM < c < 0), remains unsolved, even in case fi = A. 

3) Under the assumption that K is finite, the strict convexity of Uq(c) implies 
uniqueness of the minimizing function t(-) in (3.10) and gives the profile of the 
longest increasing subsequence under the conditioning that an upper tail deviation 
occurred, for any fi. Indeed, for an optimal curve (f), the minimizing function t(x) = 
t ( / ) (x) in the variational problem (3.10) is readily seen to have the interpretation as 
the (local) fluctuation from the mean behavior, and strict convexity of Uq would 
imply that t(x) = c/J^, a constant. 

4) It is natural to ask what happens when Q is replaced by [0, l] d , d > 2. The 
subadditivity argument for the upper tail is the same, as well as the analog of 
Theorem 3 (with exponential speed n 1 ^, and functional J M as given in [3], page 
864) . What about the lower tail? The lack of a direct probabilistic proof of Theorem 
1, and the unavailability of the Schensted correspondence in higher dimension makes 
finding the analog of Theorem 1 challenging. One can still show, however, that the 
order of decay is exponential in n. 

5) As pointed out to us by P. Baxendale, it seems reasonable to expect that under 
the conditioning ^ max ( n ) > ot n / \fn — > oo, the maximizing subsequences con- 
centrate around the solutions of the optimization problem (1.2). For a n = n, this 
was proved in [3], and the technique of the proof seems to carry to the general case. 
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6) Let M n be the number of points of a Poisson point process on Q with intensity 
n\x and denote by lm ax (M n ) the length of the longest increasing subsequence of W n . 
Note that, conditioned on J\f n = m, the law of £ ms , x (J\f n ) is the same as the law of 
l max (m). Applying the same type of argument as in the first step of the proof of 
Theorem 2, one shows 

lim -i-logP(^ max (Ar„) > (2J M + c)v^) = -JMc/Jj. 

The corresponding result for the lower tail (in the case fi = A) can also be read off 
Theorem 1, c.f. [8]. Note that in this case the rate function does differ from the 
uniform case due to fluctuations in the number of points in the Poisson sample. 
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